Audio-visual apparatus with a voice recognition function

ABSTRACT

An audio-visual apparatus includes a voice input section, and a voice recognition section configured to recognize the voices inputted from the voice input section. The audio-visual apparatus also includes a control section configured to instruct components of the apparatus body to perform respective operations for executing a command according to a voice recognized by the voice recognition section, and a storage section. The control section records in the storage section last settings of the apparatus body that are active before the execution of the command and then executes the command. When a specified code associated with a voice representing an UNDO command for undoing the former command is received after the execution of the command, the control section instructs the components of the apparatus body to perform respective UNDO operations for restoring the last settings stored in the storage section from the current settings of the apparatus body.

CROSS REFERENCE

This Nonprovisional application claims priority under 35 U.S.C. § 119(a)on Patent Application No. 2005-179443 filed in Japan on Jun. 20, 2005,the entire contents of which are hereby incorporated by reference.

BACKGROUND OF THE INVENTION

The present invention relates to AV (Audio-Visual) apparatus includingtelevision receivers, optical disk devices and hard disk recorders(hereinafter will be referred to as HDRs) and, more particularly, an AVapparatus with a voice recognition function which is capable ofrecognizing voices of a user and controlling the apparatus body based onthe voices.

The voice recognition technology for recognizing voices inputtedincludes the technique of extracting a word-like portion of a voicebeing uttered, which is so-called word spotting, the technique ofrecognizing successive voices by using HMM (Hidden Marcov Model), andlike techniques.

Conventionally widespread AV apparatus are generally provided with aninfrared wireless remote controller for remote control of such an AVapparatus. Existing AV apparatus of another type adopt theaforementioned voice recognition technology for remote control byvoices. Representatives of such AV apparatus with the voice recognitionfunction include the following television receiver, optical disk deviceand HDR.

For example, when the user utters a voice representing a command topower ON/OFF, change channels or adjust the sound volume or a likecommand, the television receiver receives the voice by means of amicrophone or the like and recognizes the voice. Then, the televisionreceiver executes the command to power ON/OFF, change channels or adjustthe sound volume or the like command according to the voice recognized.

In the case of the optical disk device or the HDR, when the user uttersa voice representing a command to perform a trick play such as replay,fast forward or reverse or a like command, the optical disk device orthe HDR receives the voice by means of a microphone or the like andrecognizes the voice. Then, the optical disk device or the HDR executesthe command to perform replay, fast forward or reverse or a like commandaccording to the voice recognized.

On the other side, an audio apparatus controller with a voicerecognition function has been proposed as described in Japanese PatentLaid-Open Publication No. HEI 6-202690. This audio apparatus controllerincludes a control section configured to determine whether or not acommand represented by a voice recognized is suited to a currentoperation status of the audio apparatus. The audio apparatus is a CDplayer for example. The control section executes a command only when itis determined that the command representing a voice recognized is suitedto the current operation status of the audio apparatus.

The aforementioned voice recognition technology, however, generally hasa high possibility of failure in recognizing voices.

For this reason, with the aforementioned conventional AV apparatus withthe voice recognition function, it is possible that the apparatusmistakes the command uttered by the user for another by erroneousrecognition and executes the wrong command. In the case of a televisionreceiver for example, it is possible that the current channel is changedto channel 8 in spite of a command given by the user to increase thesound volume. In the case of an optical disk device or a HDR, it ispossible that reverse is performed in spite of a command given by theuser to perform replay.

When the wrong command is executed, the user has to recall a lastsetting in order to correct the wrong command and utter a command torestore the last setting. Here, settings of the television receiver aremeant to include a channel setting, sound volume setting, and the like.Settings of the optical disk device or HDR are meant to include settingsfor-trick plays, a location on the medium at which a trick play settingis active currently, and the like.

In this case, the user has to seek the last setting because the userseldom bears the last setting in mind. In the case of the televisionreceiver for example, the user has to seek a last channel setting bychanging channels sequentially. In the case of the optical disk deviceor HDR, the user has to seek a last setting to find out the lastlocation on the medium by fast-forwarding with the last image or thelike as a clue.

In addition, even when the user utters a command to restore the lastsetting, it is possible that the apparatus again fails to recognize thevoice representing that command.

As described above, any one of the conventional AV apparatus with thevoice recognition function has made the user feel troublesome inrestoring a last setting.

In the case of the audio apparatus controller with the voice recognitionfunction described in the aforenoted Japanese Patent Laid-OpenPublication No. HEI 6-202690, even if a command represented by a voicerecognized is suited to a current operation status of the audioapparatus, the voice recognized does not necessarily meet the commandintended by the user. Therefore, it is possible that a command notintended by the user is executed. For this reason, like theabove-described conventional AV apparatus with the voice recognitionfunction, the audio apparatus controller with the voice recognitionfunction has made the user feel troublesome.

A feature of the present invention is to provide an AV apparatus with avoice recognition function which is capable of restoring a last settingeasily.

SUMMARY OF THE INVENTION

An audio-visual (AV) apparatus with a voice recognition functionaccording to the present invention includes a voice input section towhich voices are inputted, and a voice recognition section configured torecognize the voices. The voice input section comprises a microphone forexample. The AV apparatus is a television receiver, an optical diskdevice or a HDR for example.

The AV apparatus also includes: a code storage section having prestoredtherein a control code associated with a voice representing a command tocontrol an apparatus body, and a specified code associated with a voicerepresenting an UNDO command to undo the command; and a recognized voiceprocessing section configured to fetch the control code or the specifiedcode associated with the voice recognized by the voice recognitionsection from the code storage section and output the control code or thespecified code thus fetched. Preferably, the UNDO command is such asimple word that can easily be voice-recognized correctly withoutfailure. For example, such a simple word is “UNDO”.

The AV apparatus with the voice recognition function further includes: acontrol section configured to instruct components of the apparatus bodyto perform respective operations for executing the command according tothe control code associated therewith upon receipt of the control codeoutputted from the recognized voice processing section; and a settingsstorage section configured to store therein settings of the apparatusbody.

The control section records in the settings storage section settings ofthe apparatus body that are active before the execution of the commandas last settings before command execution and then executes the command.When the specified code outputted from the recognized voice processingsection is received after execution of the command, the control sectioninstructs the components of the apparatus body to perform respectiveUNDO operations for restoring the last settings stored in the settingsstorage section from the current settings of the apparatus body.

This configuration accommodates to cases where the AV apparatus with thevoice recognition function mistakes a command uttered by the user foranother command by erroneous recognition and executes the wrong command.

In such a case, the user utters “UNDO” as the UNDO command. In responseto the UNDO command received, the AV apparatus restores the lastsettings that have been active before the execution of the wrong commandfrom the current settings.

The foregoing and other features and attendant advantages of the presentinvention will become apparent from the reading of the followingdetailed description of the preferred embodiments in conjunction withthe accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a conceptual view illustrating a state where the user utters“UNDO” as the UNDO command with respect to a digital television receiverthat has mistaken a command from the user for a wrong command byerroneous recognition;

FIG. 2 is a block diagram showing a basic configuration of a digitaltelevision receiver according to a first embodiment of the presentinvention;

FIG. 3 is a table showing correspondences between words stored in astorage section 34 and control codes associated with the respectivewords;

FIG. 4 is a flowchart of process steps of an operation performed by acontrol section of the digital television receiver according to thefirst embodiment of the present invention;

FIG. 5 is a conceptual view illustrating a state where the user utters“UNDO” as the UNDO command with respect to an optical disk device thathas mistaken a command from the user for a wrong command by erroneousrecognition;

FIG. 6 is a block diagram showing a basic con-figuration of an opticaldisk device according to a second embodiment of the present invention;

FIG. 7 is a table showing correspondences between words stored in astorage section 44 and control codes associated with the respectivewords;

FIG. 8 is a flowchart of process steps of an operation performed by acontrol section of the optical disk device according to the secondembodiment of the present invention;

FIGS. 9A to 9D are representations each indicating one trick playdisplayed on LCD 62A and a location on optical disk 70 at which thetrick play proceeds;

FIG. 10 is a view showing the outward appearance of a digital televisionreceiver according to a third embodiment of the present invention;

FIG. 11 is a block diagram showing a basic configuration of the digitaltelevision receiver according to the third embodiment of the presentinvention;

FIG. 12 is a view showing the outward appearance of a digital televisionreceiver according to a fourth embodiment of the present invention; and

FIG. 13 is a block diagram showing a basic configuration of the digitaltelevision receiver according to the fourth embodiment of the presentinvention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Hereinafter, a digital television receiver according to the firstembodiment of the present invention will be described in detail.

FIG. 1 is a conceptual view illustrating a state where the user utters“UNDO” as the UNDO command with respect to a digital television receiverthat has mistaken a command from the user for a wrong command byerroneous recognition.

A digital television receiver (hereinafter will be referred to as D-TVreceiver) 1 includes a CRT 16A configured to display video images, and avoice input section 30 comprising a microphone or the like for receivingvoices.

In FIG. 1, the selected channel of D-TV receiver 1 has been erroneouslychanged to channel 8 from channel 6 selected according to the lastchannel setting by erroneous recognition of a command given by the user2 to increase the sound volume.

Then, the user 2 utters “UNDO” as an UNDO command. In response to theUNDO command received, the D-TV receiver 1 puts back the current setting(channel 8) to the last setting (channel 6).

In response to the UNDO command merely uttered by the user in this way,the D-TV receiver 1 restores the last setting easily. The flow of thisoperation will be described below.

FIG. 2 is a block diagram showing a basic configuration of the digitaltelevision receiver according to the first embodiment of the presentinvention.

The D-TV receiver 1 also includes a control section 11 configured tocontrol the entire D-TV receiver 1, a tuner 12, a video/audio switchingsection 13, an OSD processing section 14, a video processing section 15,a display section 16, an audio processing section 17, a sound outputsection 18, an operating section 19, and a storage section 21 as asettings storage section.

The D-TV receiver 1 further includes a voice input section 30, a soundprocessing section 31, a voice recognition section 32, a recognizedvoice processing section 33, and a storage section 34 as a code storagesection.

The control section 11 comprises a microcomputer for example.

Upon receipt of a command to power ON/OFF, change channels or adjust asound volume or a like command inputted to the operating section 19, theoperating section 19 transfers the command to the control section 11.

The tuner 12 extracts television broadcasting signals on a currentlyselected channel from all the television broadcasting signals receivedthrough an antenna 20 and then outputs the signals thus extracted.

The video/audio switching section 13 extracts video signals, audiosignals and an electronic program guide (EPG) from the televisionsignals outputted from the tuner 12 separately from each other, anddecodes and outputs each type of signals. The television broadcastingsignals outputted from the tuner 12 are signals compressed by means ofMPEG 2 for example.

The video signals outputted from the video/audio switching section 13are inputted to the video processing section 15 via the OSD processingsection 14.

The OSD processing section 14 superimposes a character (letter) or afigure on the video signals according to a control signal transferredfrom the control section 11.

The video processing section 15, which has a D/A converter circuit forconverting video signals outputted from the video/audio switchingsection 13 to analog video signals, outputs such video signals thusconverted.

The display section 16 processes the video signals outputted from thevideo processing section 15 and displays a corresponding video image onthe CRT 16A. Such video images include, in addition to video imagesprovided by television programs, an OSD image such as “CHANNEL 8” shownin FIG. 1, and the aforementioned electronic program guide.

The audio processing section 17, which has a D/A converter circuit forconverting audio signals outputted from the video/audio switchingsection 13 to analog audio signals, outputs such audio signals thusconverted.

The sound output section 18 processes the audio signals outputted fromthe audio processing section 17 and then generates sounds to theexterior. The sound output section 18 comprises a speaker for example.

On the other side, the voice input section 30 is capable of receivingvoices uttered by the user and comprises a microphone for example. Whena voice uttered by the user or sound comprising ambient noises isinputted to the voice input section 30, the voice input section 30converts such sound or voice to audio signals and then outputs theseaudio signals to the sound processing section 31.

The sound processing section 31 eliminates noises from the audio signalsoutputted from the voice input section 30 by noise canceling and thenoutputs the thus noise-eliminated audio signals to the voice recognitionsection 32.

The voice recognition section 32 performs voice recognition by using theaudio signals outputted from the sound processing section 31 and thenoutputs a word (data) corresponding to the recognized voice to therecognized voice processing section 33.

FIG. 3 is a table showing correspondences between words stored in thestorage section 34 and control codes associated with the respectivewords.

The storage section 34 comprises ROM for example.

The storage section 34 has stored therein correspondences between wordsand their respective associated control codes in a table form.Specifically, the storage section 34 has prestored therein control codesassociated with respective of voices (words) each representing a commandsuch as to power ON/OFF, change channels or adjust the sound volume or alike command. The storage section 34 has further prestored therein aspecial code associated with a voice (word “UNDO”) representing the UNDOcommand to undo the former command.

In this embodiment, the special code are equivalent to the specifiedcode defined by the present invention.

The recognized voice processing section 33 searches the table stored inthe storage section 34 for the control code associated with the word(data) outputted from the voice recognition section 32, fetches thecontrol code, and outputs the code thus fetched. If the word outputtedfrom the voice recognition section 32 is “Channel One”, the recognizedvoice processing section 33 fetches and outputs the control code“1100000111”. Alternatively, if the word outputted from the voicerecognition section 32 is “UNDO”, the recognized voice processingsection 33 fetches and outputs the special code “0000000111”.

While this embodiment uses the word “UNDO” as the UNDO command, it ispossible to use any simple word that can be voice-recognized easilywithout failure.

FIG. 4 is a flowchart of process steps of an operation performed by thecontrol section of the digital television receiver according to thefirst embodiment of the present invention. This operation is performedin response to a command received by the control section 11.

In response to receipt of the control code outputted from the recognizedvoice processing section 33, the control section 11 records in thestorage section 21 settings of the receiver body 1 that are active justbefore execution of the command associated with the control code (stepS1) before the execution of the command and then executes the commandaccording to the control code (step S2).

The “settings of the receiver body 1 that are active just beforeexecution of the command” is meant to include channel setting and soundvolume setting that are active just before execution of the command. Thechannel setting includes settings adjusted to physical channel numbersas well as settings adjusted to virtual channel numbers.

Subsequently, the control section 11 determines whether or not the userhas uttered “UNDO” as the UNDO command (step S3). This determination isbased on whether or not the control section 11 has received the specialcode outputted from the recognized voice processing section 33.

If it is determined that the user 2 has not uttered the UNDO command,the control section 11 determines whether or not any command has beeninputted to the control section (step S4). This determination is basedon whether or not the control section 11 has received any control codeoutputted from the recognized voice processing section 33.

If it is determined that any command has not been inputted to thecontrol section 11, the control section 11 returns the process to stepS3 to continue the process.

Alternatively, if it is determined that any command has been inputted tothe control section 11, the control section 11 returns the process tostep S1 to continue the process.

If it is determined in step S3 that the user 2 has uttered the UNDOcommand, the control section 11 instructs components of the receiverbody 1 to perform respective UNDO operations for restoring the lastsettings stored in the storage section 21 from the current settings ofthe receiver body 1 (step S5).

In each of steps S2, S3 and S5, it is assumed that the selected channelof D-TV receiver 1 has been undesirably changed to channel 8 fromchannel 6 selected according to the last channel setting by the D-TVreceiver having erroneously recognized a command given by the user 2 toincrease the sound volume as a command to change the current channel tochannel 8. In response to the UNDO command received by the D-TV receiver1 upon utterance of “UNDO” by the user 2 (affirmative determination instep S3), the D-TV receiver 1 restores the last channel setting (channel6) from the current channel setting (channel 8) (step S5).

Thus, the last settings can be restored easily by mere utterance of theUNDO command. Therefore, the user is not made to feel inconvenient anylonger.

Subsequently, the control section 11 waits for input of some command(step S6) and then returns the process to step S1 to continue theprocess.

Description will be made of an optical disk device according to thesecond embodiment of the present invention.

FIG. 5 is a conceptual view illustrating a state where the user utters“UNDO” as the UNDO command with respect to an optical disk device thathas mistaken a command from the user for a wrong command by erroneousrecognition. The optical disk device 51 includes a LCD (Liquid CrystalDisplay) 62A configured to display video images, and a voice inputsection 30 comprising a microphone or the like for receiving voices.

The last trick play setting of the optical disk device 51 was “fastforward” and the location on the optical disk (medium) 70 at which thetrick play (fast forward) was active at “Episode 8, 00:15:24” (see FIG.9A to be referred to later).

However, the optical disk device 51 has mistaken a command given by theuser 2 to perform replay for another command by erroneous recognitionand is now performing “reverse” undesirably as shown in FIG. 5 (see FIG.9B to be referred to later).

Then, the user 2 utters “UNDO” as an UNDO command. In response to theUNDO command received, the optical disk device 51 puts back the currentsetting (“reverse” at the location “Episode 8, 00:14:86”) to the lastsetting (“fast forward” at the location “Episode 8, 00:15:24”) (see FIG.9C to be referred to later). Further, the optical disk device 51 stopsthe last trick play (“fast forward”) (see FIG. 9D to be referred tolater).

Thus, by mere utterance of the UNDO command by the user 2, the opticaldisk device 51 can undo the erroneously recognized command and performreplay from a location intended by the user. The flow of this operationwill be described in detail below.

FIG. 6 is a block diagram showing a basic configuration of the opticaldisk device according to the second embodiment of the present invention.The optical disk device 51 includes a pickup head 52 (hereinafter willbe referred to as PU head 52), a recording/reproducing section 53comprising a RF amplifier, an operating section 60 provided with areplay key and a record key, a display section 62, and a storage section61 as a settings storage section.

The optical disk device 51 further includes a voice input section 30, asound processing section 31, a voice recognition section 32, arecognized voice processing section 33, and a storage section 44 as acode storage section.

The aforementioned voice input section 30, sound processing section 31,voice recognition section 32 and recognized voice processing section 33are each similar to the corresponding one of the D-TV receiver 1 shownin FIG. 2.

The PU head 52 includes non-illustrated components including a laserdiode (LD), a collimator lens, a beam splitter, an objective lens, aphotodetector, a thread motor, and an actuator.

The PU head 52 is movably mounted on a shaft extending radially of theoptical disk 70. The thread motor moves the PU head 52 radially of theoptical disk 70.

The LD is a light source configured to output laser light.

The objective lens adjusts the laser light irradiation point on theoptical disk 70.

In the replay process, the PU head 52 irradiates the optical disk 70with laser light of a power for reading and detects reflected light fromthe optical disk 70 by means of the photodetector.

In the recording process, on the other hand, the PU head 52 controlslaser light of a power for recording in accordance with digital signalsinputted from the recording/reproducing section 53 to irradiate therecording surface of the optical disk 70 with laser light.

In the replay process, the recording/reproducing section 53 generates RFsignals based on the outputs of plural light-receiving elements of thePU head 52, and amplifies these RF signals. Subsequently, therecording/reproducing section 53 processes such RF signals to extract AVdata therefrom and then decodes the AV data thus extracted. The AV dataextracted here is data encoded by MPEG for example. Thereafter, therecording/reproducing section 53 converts the elongated data to analogreplay signals and then outputs these replay signals to the exterior.

In the recording process, the recording/reproducing section 53 receivesTV broadcasting signals via an antenna such as a TV antenna anddigitizes these signals.

Subsequently, the recording/reproducing section 53 encodes the digitizedAV data (by means of MPEG for example) and outputs the digitized signalsto the PU head 52.

The operating section 60 is provided for the user 2 to input variouscommands to the optical disk device 51. Such a command inputted by theuser 2 to the optical disk device 51 is transferred to the controlsection 54.

The display section 62 displays time, information on the currentsettings (including the current trick play setting and the location onthe optical disk 70 at which the current trick play is proceeding), andlike information on the LCD 62A (see FIG. 5 and FIGS. 9A to 9D to bereferred to later).

The control section 54, which comprises a microcomputer for example,controls the components of the optical disk device 51 in accordance witha command inputted by the user 2 to the optical disk device 51.

Though tracking servo control and focus servo control are actuallyexercised in the reading and recording processes, description thereof isomitted from the description of this embodiment.

FIG. 7 is a table showing correspondences between words stored in thestorage section 44 and control codes associated with the respectivewords. The storage section 44 comprises ROM for example.

The storage section 44 has stored therein correspondences between wordsand their respective associated control codes in a table form.Specifically, the storage section 44 has prestored therein control codesassociated with respective of voices (words) each representing a commandsuch as to power ON/OFF, perform replay or fast-forward or a likecommand. The storage section 44 has further prestored therein a specialcode associated with a voice (word “UNDO”) representing the UNDO commandto undo the former command.

In this embodiment, the special code are equivalent to the specifiedcode defined by the present invention.

The recognized voice processing section 33 fetches the control codeassociated with a word (data) outputted by the voice recognition section32 from the table stored in the storage section 44, and then outputs thecode thus fetched to the control section 54. If the word outputted fromthe voice recognition section 32 is “REPLAY”, the recognized voiceprocessing section 33 fetches and outputs the control code “1110000111”.Alternatively, if the word outputted from the voice recognition section32 is “UNDO”, the recognized voice processing section 33 fetches andoutputs the special code “0000000111”.

FIG. 8 is a flowchart of process steps of an operation performed by thecontrol section of the optical disk device according to the secondembodiment of the present invention. This operation is performed inresponse to a command received by the control section 54. In response toreceipt of any control code outputted from the recognized voiceprocessing section 33, the control section 54 determines whether or notthe command associated with this control code is a stop command (stepS101). This determination is based on whether or not the control codereceived is “1110111111” representing the stop code (see FIG. 7).

If it is determined that the command received is the stop command, thecontrol section 54 determines whether or not recording is proceedingcurrently (step S110).

In the case of affirmative determination in each of steps S101 and S110,it is assumed that the control section 54 receives the stop commandduring the recording process. In this case, the control section 54 doesnot immediately execute the stop command and continues recording in thebackground for a fixed time period (step S113 to be described later)while displaying information to the effect that the stop command hasbeen received on the LCD 62A (step S111 to be described later). If thereis no utterance of the UNDO command within the fixed time period(negative determination in step S112 to be described later), the controlsection 54 stops recording (step S114 to be described later).Alternatively, if there is an utterance of the UNDO command within thefixed time period (affirmative determination in step S112 to bedescribed later), the control section 54 disregards the stop command(step S115 to be described later).

If the user 2 utters any command during a trick play other thanrecording in step S101 or S110, the control section 54 advances theprocess to step S102 to continue the process.

FIGS. 9A to 9D are representations each indicating one trick playdisplayed on the LCD 62A and a location on the optical disk 70 at whichthe trick play proceeds. Specifically, FIG. 9A is a representationindicating a trick play proceeding prior to the utterance of a command(to perform replay) by the user 2 and a location on the optical disk 70at which the trick play proceeds at that time. FIG. 9B is arepresentation indicating a trick play proceeding after the utterance ofthe command (to perform replay) by the user 2 and a location on theoptical disk 70 at which the trick play proceeds at that time. FIG. 9Cis a representation indicating a trick play caused to proceed by theUNDO operation of the optical disk device 51 in response to an utteranceof the UNDO command by the user 2 and a location on the optical disk 70at which the trick play proceeds at that time. FIG. 9D is arepresentation indicating a trick play (stop) caused to proceed afterthe UNDO operation of the optical disk device 51 and a location on theoptical disk 70 at which the trick play proceeds at that time.

If it is determined in step S101 that the command received is not thestop command, or if it is determined in step S110 that recording is notproceeding currently, the control section 54 records in the storagesection 61 settings of the device body 51 that are active just beforeexecution of the command prior to the execution of the command (stepS102).

The expression “settings of the device body 51 that are active justbefore execution of the command” is meant to include a trick playsetting that is active just before the execution of the command and alocation on the optical disk 70 at which the trick play proceeds at thattime. Such trick plays are operations of the device body and include“REPLAY”, “SLOW REPLAY”, “FAST FORWARD”, “REVERSE”, “PAUSE”, and “STOP”for example. The “location” is a location of access on the optical disk70. More specifically, the “location” is a location on the optical disk70 irradiated with laser light, or a location at which the optical axisof laser light intersects the recording surface of the optical disk 70.An example of such a location is “PANDAMAN, EPIDODE 8, 00:15:24”displayed on the LCD 62A as shown in FIG. 9A.

In the case of step S102, it is assumed that the settings of the devicebody 51 that are active just before the execution of the command aresettings (“FAST FORWARD” and “PANDAMAN, EPISODE 8, 00:15:24”) as shownin FIG. 9A. In this case, the control section 54 records “FAST FORWARD”and “PANDAMAN, EPISODE 8, 00:15:24” in the storage section 61 (stepS102).

Subsequently, the control section 54 executes the command received (stepS103).

If the user 2 utters any other command than the stop command, forexample, “REPLAY” during recording, the control section 54 disregardsthat command (step S103).

In the case of step S103, it is assumed that the optical disk device 51has failed in voice recognition of, for example, the command (“REPLAY”)uttered by the user 2 and the control section 54 causes “REVERSE” toproceed (step S103). In this case, the settings of the device body 51are changed from, for example, the settings (“FAST FORWARD” and“PANDAMAN, EPISODE 8, 00:15:24”) shown in FIG. 9A to the settings(“REVERSE” and “PANDAMAN, EPISODE 8, 00:14:46”) shown in FIG. 9B.

Subsequently, the control section 54 determines whether or not the user2 has uttered the UNDO command (step S104). This determination is basedon whether or not the control section 54 has received the special codeoutputted from the recognized voice processing section 33.

If it is determined that the user 2 has not uttered the UNDO command,the control section 54 determines whether or not any command has beeninputted to the control section 54 (step S105). This determination isbased on whether or not the control section 54 has received any controlcode outputted from the recognized voice processing section 33.

If it is determined that any command has not been inputted to thecontrol section 54, the control section 54 returns the process to stepS104 to continue the process.

Alternatively, if it is determined that any command has been inputted tothe control section 54, the control section 54 returns the process tostep S101 to continue the process.

If it is determined in step S104 that the user 2 has uttered the UNDOcommand, the control section 54 instructs components of the device bodyto perform respective UNDO operations for restoring the last settingsstored in the storage section 61 from the current settings of the devicebody (step S106).

Here, it is assumed that the optical disk device 51 receives the UNDOcommand “UNDO” uttered by the user 2 (affirmative determination in stepS104) and then restores the last settings from the current settings(step S106). In this case, the settings of the device body 51 arechanged from, for example, the settings (“REVERSE” and “PANDAMAN,EPISODE 8, 00:14:46”) shown in FIG. 9B to the settings (“FAST FORWARD”and “PANDAMAN, EPISODE 8, 00:15:24”) shown in FIG. 9C.

However, the settings (“FAST FORWARD” and “PANDAMAN, EPISODE 8,00:15:24”) resulting from the UNDO operation cause “FAST FORWARD” to beperformed thereby changing the location on the optical disk 70. If theuser 2 utters any command during fast forward, it is highly possiblethat the user 2 intends to stop fast-forwarding and wants the device toperform “REPLAY”, “PAUSE” or a like trick play at the stopped location.

In order to prevent the stopped location from changing, the controlsection 54 performs steps S107 and S108 to be described later.

First, the control section 54 determines whether or not the trick playrecorded in the storage section 61 is of the type which causes thelocation of access on the optical disk 70 to change like such a trickplay as “REPLAY”, “SLOW REPLAY”, “FAST FORWARD” or “REVERSE” (stepS107).

If it is determined that the trick play is of the type which causes thelocation of access on the optical disk 70 to change, the control section54 instructs the components of the device body to “STOP” (step S108),waits for input of any command (step S109), and then returns the processto step S101 to continue the process.

By step S108, the settings of the device body 51 are changed from, forexample, the last settings (“FAST FORWARD” and “PANDAMAN, EPISODE 8,00:15:24”) shown in FIG. 9C to the settings (“STOP” and “PANDAMAN,EPISODE 8, 00:15:24”) shown in FIG. 9D.

Thus, the location of access fails to change after the UNDO operationand, hence, the optical disk device 51 can perform “REPLAY” or a liketrick play from the location intended by the user.

If it is determined in step S107 that the trick play is not of the typewhich causes the location on the optical disk 70 to change, i.e., if thetrick play is “STOP” or “PAUSE”, the control section 54 waits for inputof any command (step S109), and then returns the process to step S101 tocontinue the process.

Description will be made of a case where the optical disk 70 set on theoptical disk device 51 is of the rewritable type and the optical diskdevice 51 has performed “RECORDING” not intended by the user 2 becauseof erroneous recognition of a command in step S103. If it is determinedthat the user 2 has uttered the UNDO command (affirmative determinationin step S105), the control section 54 stops “RECORDING” and performs thelast trick play to erase the undesirably recorded contents from theoptical disk 70 for restoring the last settings (step S106).

Alternatively, if it is determined in step S110 that recording isproceeding currently, the control section 54 causes the LCD 62A todisplay information to the effect that the command to stop “RECORDING”has been received (step S111). Such information is “The command to stoprecording has been received. The command will be executed in severalseconds.” or “Recording is now going to stop.”

Subsequently, the control section 54 determines whether or not the user2 has uttered the UNDO command (step S112). This determination is madein the same manner as in step S104. In the case of step S112, it isassumed that the user 2, informed from the display to the effect thatthe command to stop “RECORDING” has been received, utters the UNDOcommand to undo the recording stop command or does not utter the UNDOcommand because the user 2 intends to stop recording.

If it is determined that the user 2 has not uttered the UNDO command,the control section 54 determines whether or not a fixed time period hasbeen elapsed from the time the information about the receipt of thecommand to stop “RECORDING” was displayed on the LCD 62A (step S113).The fixed time period is previously set to a time period sufficient toallow the user 2 to utter the UNDO command after having been informed ofthe receipt of the recording stop command. The fixed time period is 10seconds for example.

After lapse of the fixed time period, the control section 54 executesthe aforementioned stop command (step S114). Thereafter, the controlsection 54 waits for input of any command (step S109). In response tothe input of any command, the control section 54 returns the process tostep S101 to continue processing.

Alternatively, if it is determined in step S112 that the user 2 hasuttered the UNDO command, the control section 54 disregards theaforementioned stop command (step S115). Thereafter, the control section54 waits for input of any command (step S109). In response to the inputof any command, the control section 54 returns the process to step S101to continue processing.

While the optical disk device 51 according to this embodiment has beendescribed as a representative of AV apparatus with the voice recognitionfunction, the present invention is applicable to hard disk recorderswith the voice recognition function.

A digital television receiver according to the third embodiment of thepresent invention will be described below.

FIG. 10 is a view showing the outward appearance of the digitaltelevision receiver according to the third embodiment of the presentinvention. FIG. 11 is a block diagram showing a basic configuration ofthe digital television receiver according to the third embodiment of thepresent invention.

The D-TV receiver 101 shown in FIGS. 10 and 11 is different from theD-TV receiver 1 shown in FIG. 2 in that the voice input section 30 islocated exteriorly of the cabinet of the D-TV receiver. The operation ofthe D-TV receiver 101 shown in FIGS. 10 and 11 is the same as that ofthe D-TV receiver 1 shown in FIG. 2.

A digital television receiver according to the fourth embodiment of thepresent invention will be described below.

FIG. 12 is a view showing the outward appearance of the digitaltelevision receiver according to the fourth embodiment of the presentinvention. FIG. 13 is a block diagram showing a basic configuration ofthe digital television receiver according to the fourth embodiment ofthe present invention.

The D-TV receiver 201 shown in FIGS. 12 and 13 is different from theD-TV receiver 1 shown in FIG. 2 in that the D-TV receiver 201 isprovided with a remote controller 100 for remote control of the D-TVreceiver 201 and a receiving section 23 for receiving a control code orspecial code transmitted from the remote controller 100 (transmittingsection 35).

The remote controller 100 includes a voice input section 30, a soundprocessing section 31, a voice recognition section 32, a recognizedvoice processing section 33, a storage section 34, and transmittingsection 35 configured to transmit a control code or special codeoutputted from the recognized voice processing section 33.

With this embodiment, the user 2 utters voices against the voice inputsection 30 of the remote controller 100.

In this embodiment, the D-TV receiver 201 and the remote controller 100,as a whole, are equivalent to the AV apparatus defined by the presentinvention.

The operation of the D-TV receiver 201 shown in FIGS. 12 and 13 is thesame as that of the D-TV receiver 1 shown in FIG. 2.

While only certain presently preferred embodiments of the presentinvention have been described in detail, as will be apparent for thoseskilled in the art, certain changes and modifications may be made inembodiments without departing from the spirit and scope of the presentinvention as defined by the following claims.

1. An audio-visual apparatus with a voice recognition functioncomprising: a voice input section to which voices are inputted: a voicerecognition section configured to recognize the voices; a code storagesection having prestored therein a control code associated with a voicerepresenting a first command to control an apparatus body, and aspecified code associated with a voice representing an UNDO command toundo the first command; a recognized voice processing section configuredto fetch the control code or the specified code associated with thevoice recognized by the voice recognition section from the code storagesection and output the control code or the specified code thus fetched;a control section configured to instruct components of the apparatusbody to perform respective operations for executing the first commandaccording to the control code associated therewith upon receipt of thecontrol code outputted from the recognized voice processing section; anda settings storage section configured to store therein settings of theapparatus body, wherein: the control section records in the settingsstorage section settings of the apparatus body that are active beforethe execution of the first command as last settings before commandexecution and then executes the first command; when the specified codeoutputted from the recognized voice processing section is received afterthe execution of the first command, the control section instructs thecomponents of the apparatus body to perform respective UNDO operationsfor restoring the last settings stored in the settings storage sectionfrom the current settings of the apparatus body; the control sectiondetermines whether a second command executed after instructing thecomponents of the apparatus body to perform the respective UNDOoperations, is FAST FORWARD or REVERSE; and when determining that thesecond command is FAST FORWARD or REVERSE, the control section instructsthe components of the apparatus body to stop the second command.
 2. Theaudio-visual apparatus according to claim 1, which is an optical diskdevice with a voice recognition function, or a hard disk recorder with avoice recognition function.
 3. The audio-visual apparatus according toclaim 1, wherein the control section records in the settings storagesection the last settings before first command execution including lastchannel setting that is active before the execution of the firstcommand, last sound volume setting that is active before the executionof the first command, last setting of the second command that is activebefore the execution of the first command, and a location on a medium atwhich the second command is executed before the execution of the firstcommand.
 4. The audio-visual apparatus according to claim 1, furthercomprising a remote controller, the remote controller having the voiceinput section, the voice recognition section, the code storage section,the recognized voice processing section, and a transmitting sectionconfigured to transmit the control code and the specified code outputtedby the recognized voice processing section.
 5. The audio-visualapparatus according to claim 1, further comprising a display section,wherein when the control section receives, during the recording process,the control code which represents a stop command which is one of thefirst command, the display section displays a receipt of the stopcommand, the control section: continues recording even if receiving thestop command during the recording process; stops recording if notreceiving the specified code within a fixed time period displayed on thedisplay section; and disregards the stop command if receiving thespecified code within the fixed time period.